Figure 1 



Heteractis crispa chromoprotein wild type (base isoform) 

10 20 30 40 50 60 

5 1 ACCATTTGCTTTGGTTCCTTGGCAAACGAAAGTTTAGAACGAAAACTGACCCAAATTACA 
70 80 90 100 110 120 

TCTTCCTCCTGGATCCTTACCATGGCTGGTTTGTTGAAAGAAAGTATGCGCATCAAGATG 

MAGLLKESMRIKM 

130 140 150 160 170 180 

TACATGGAAGGCACGGTTAATGGCCATTATTTCAAGTGTGAAGGAGAGGGAGACGGCAAC 
YMEGTVNGHYFKCEGEGDGN 

190 200 210 220 230 240 

CCATTTACAGGTACGCAGAGCATGAGGATTCATGTCACCGAAGGGGCTCCATTACCATTT 
PFTGTQSMRIHVTEGAPLPF 

250 260 270 280 290 300 

GCCTTCGACATTTTGGCACCGTGTTGTGAGTACGGCAGCAGGACCTTTGTCCACCATACG 
AFDILAPCCEYGSRTFVHHT 

310 320 330 340 350 360 

GCAGAGATTCCCGATTTCTTCAAGCAGTCTTTCCCTGAAGGCTTTACTTGGGAAAGAACC 
AEIPDFFKQSFPEGFTWERT 

370 380 390 400 410 420 

ACAAC CTATGAAGATGGAGGCATT CTTACTGCT CATCAGGACAC AAGC CTGGAGGGGAAC 
TTYEDGGILTAHQDTSLEGN 

430 440 450 460 470 480 

TGCCTTATATACAAGGTGAAAGTCCTTGGTACCAATTTTCCTGCTGATGGCCCCGTGATG 
CLIYKVKVLGTNFPADGPVM 

490 500 510 520 530 540 

AAGAACAAATCAGGAGGATGGGAGCCATGCACTGAGGTGGTTTATCCAGAGAATGGTGTC 
KHKSGGWEPCTEVVYPENGV 

550 560 570 580 590 600 

CTGTGTGGACGTAATGTGATGGCCCTTAAAGTCGGTGATCGTCGTTTGATCTGCCATCTC 
LCGRNVMALKVGDRRLICHL 

610 620 630 640 650 660 

TATACTTCTTACAGGTCCAAGAAAGCAGTCCGTGCCTTGACAATGCCAGGATTTCATTTT 
YTSYRSKKAVRALTMPGFHF 

670 680 690 700 710 720 

ACAGACATCCGCCTTCAGATGCCGAGGAAAAAGAAAGACGAGTACTTTGAACTGTACGAA 
TDIRLQMPRKKKDEYFELYE 

730 740 750 760 770 780 

GCATCTGTGGCTAGGTACAGTGATCTTCCTGAAAAAGCAAATTGATTGTTCCCAGTGACA 
ASVARYSDLPEKAN* 

790 800 810 820 830 840 

CCAGACTGCTGTCAGCTTTTGGTTAAAGCCCGAAAGACAAAAGGACATTTGTAGTTTAGT 

850 860 870 880 890 900 

TTATATTTCCCTTTCATTTGTGAATCAACATTGTACTCTCTGTAAACCTTTAAAATGCTC 

910 

CATTAAACCT 3' (SEQ ID NOs : 01 & 02) 



Figure 2 

Heteractis crispa chromoprotein wild type (second isoform) 

10 20 30 40 50 60 

5 ' ACCATTTGCTTTGGTTCCTTGGCAAACGAAAGTTTAGACGAAAACTGACCCAAATTACAT 

70 80 90 100 110 120 

CCTCCTGATCCTTACCATGGCTGGTTTGTTGAAAGAAAGTATGCGCATCAAGATGTACAT 

MAGLLKESMRIKMYM 

130 140 150 160 170 180 

GGAAGGCACGGTTAATGGCCATTATTTCAAGTGTGAAGGAGAGGGAGACGGCAACCCATT 
EGTVNGHYFKCEGEGDGNPF 

190 200 210 22 0 230 24 0 

TACAGGTACGCAGAGCATGAGGATT CATGT CAC CGAAGGGGCTC CATTAC CATTTGC CTT 
TGTQSMRIHVTEGAPLPFAF 

250 260 270 280 290 300 

CGACATTTTGGCACCGTGTTGTGAGTACGGCAGCAGGACCTTTGTCCACCATACGGCAGA 
DILAPCCEYGSRTFVHHTAE 

310 320 330 340 350 360 

GATTCCGGATTTCTTCAAGCAGTCTTTCCCTGAAGGCTTTACTTGGGAAAGAACCACAAC 
IPDFFKQSFPEGFTWERTTT 

370 380 390 400 410 420 

CTATGAAGATGGAGGCATTCTTACTGCTCATCAGGACACAAGCCTGGAGGGGAACTGCCT 
YEDGG1LTAHQDTSLEGNCL 

430 440 450 460 470 480 

TATATACAAGGTGAAAGTCCTTGGTACCAATTTTCCTGCTGATGGCCCCGTGATGAAGAA 
IYKVKVLGTNFPADGPVMKN 

4 90 50 0 510 52 0 53 0 54 0 

CAAATCAGAAGGATGGGAGCCATGCACTGAGGTGGTTTATCCAGATAATGGTGTCCTGTG 
KSEGWEPCTEVVYPDNGVLC 

550 560 570 580 590 600 

TGGACGTAATGTGATGGCCCTTAAAGTCGGTGATCGTCGTTTGATCTGCCATCTCTATAC 
GRNVMALKVGDRRL I CHLYT 

610 62 0 63 0 64 0 650 66 0 

TTCTTACAGGTCCAAGAAAGCAGTCCGTGCCTTGACAATGCCAGGATTTCATTTTACAGA 
SYRS KKAVRALTMPGFHFTD 

670 680 690 700 710 720 

CATCCGCCTTCAGATGCCGAGGAAAAAGAAAGACGAGTACTTTGAACTGTACGAAGCATC 
IRLQMPRKKKDEYFELYEAS 

730 740 750 760 770 780 

TGTGGCTAGGTACAGTGATCTTCCTGAAAAAGCAAATTGATTGTTCCCAGTGACACCAGA 

VARYSDLPEKAN* 

790 800 810 820 830 840 

CTGCTGTCAGCTTTTGGTTAAAGCCCGAAAGACAAAAGGACATTTGTAGTTTTAGTTTAT 

850 860 870 880 890 900 

ATTTTCCCTTTCATTTTGTGAATCAACATTGTACTCTCTGTAAACCTTTAAAATGCTCCA 

TTAAACCT 3' (SEQ ID NOs : 03 & 04) 



Figure 3 
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Figure 4 



Heteractis crispa fluorescent protein mutant C148S 

C148S according to GFP numbering 
C143S according to self-numbering. 



ATGGCTGGTTTGTTGAAAGAAAGTATGCGCATCAAGATG 
MAGLLKESMRIKM 

TACATGGAAGGCACGGTTAATGGCCATTATTTCAAGTGTGAAGGAGAGGGAGACGGGAAC 
YMEGTVNGHYFKCEGEGDGN 

CCATTTACAGGTACGCAGAGCATGAGGATTCATGTCACCGAAGGGGCTCCATTACCATTT 
PFTGTQSMRIHVTEGAPLPF 

GCCTTCGACATTTTGGCACCGTGTTGTGAGTACGGCAGCAGGACCTTTGTCCACCATACG 
AFDILAPCCEYGSRTFVHHT 

GCAGAGATTCCCGATTTCTTCAAGCAGTCTTTCCCTGAAGGCTTTACTTGGGAAAGAACC 
AEIPDFFKQS FPEGFTWERT 

ACAACCTATGAAGATGGAGGCATTCTTACTGCTCATCAGGACACAAGCCTGGAGGGGAAC 
TTYEDGGILTAHQDTSLEGN 

TGCCTTATATACAAGGTGAAAGTCCTTGGTACCAATTTTCCTGCTGATGGCCCCGTGATG 
CLIYKVKVLGTNFPADGPVM 

AAGAACAAATCAGGAGGATGGGAGCCAAGCACTGAGGTGGTTTATCCAGAGAATGGTGTC 
KNKSGGWEPSTEVVYPENGV 

CTGTGTGGACGTAATGTGATGGCCCTTAAAGTCGGTGATCGTCGTTTGATCTGCCATCTC 
LCGRNVMALKVGDRRLICHL 

TATACTTCTTACAGGTCCAAGAAAGCAGTCCGTGCCTTGACAATGCCAGGATTTCATTTT 
YTSYRSKKAVRALTMPGFHF 

ACAGACATCGGCCTTCAGATGCCGAGGAAAAAGAAAGACGAGTACTTTGAACTGTACGAA 
TDIRLQMPRKKKDEYFELYE 

GCATCTGTGGCTAGGTACAGTGATCTTCCTGAAAAAGCAAATTGA 
ASVARYSDLPEKAN* 
(SEQ ID NOs: 05 & 06) 



Figure 5A 
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Figure 5B 
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Figure 6 



Heteractis crispa fluorescent protein mutant 44-9 

point mutations: A5S,T39A,C148S,L181H ? P208L,K211E according to GFP 
numbering 

A2S,T36A,C143S,L173H,P201L,K204E according to self- 
numbering. 

80 90 100 110 120 

TCTGGTTTGTTGAAAGAAAGTATGCGCATCAAGATGTACAT 
SGLLKESMRIKMYM 

130 140 150 160 170 180 

GGAAGGCACGGTTAATGGCCATTATTTCAAGTGTGAAGGAGAGGGAGACGGCAACCCATT 
EGTVNGHYFKCEGEGDGNPF 

190 200 210 22 0 230 24 0 

TGCAGGTACGCAGAGCATGAGGATTCATGTCACCGAAGGGGCTCCATTACCATTTGCGTT 
AGTQSMRIHVTEGAPLPFAF 

250 260 270 280 290 300 

CGACATTTTGGCACCGTGTTGTGAGTACGGCAGCAGGACCTTTGTCCACCATACGGCAGA 
DILAPCCEYGSRTFVHHTAE 

310 320 330 340 350 360 

GATTCCCGATTTCTTCAAGCAGTCTTTCCCTGAAGGCTTTACTTGGGAAAGAACCACAAC 
IPDFFKQSFPEGFTWERTTT 

370 380 390 400 410 420 

CTATGAAGATGGAGGCATTCTTACTGCTCATCAGGACACAAGCCTGGAGGGGAACTGCCT 
YEDGGILTAHQDTSLEGNCL 

430 440 450 460 470 480 

TATATACAAGGTGAAAGTCCTTGGTACCAATTTTCCTGCTGATGGCCCCGTGATGAAGAA 
I YKVKVLGTNFPADGPVMKN 

490 500 510 520 530 540 

C AAATCAGGAGGATGGGAGC CAAGCACTGAGGTGGTTTATC CAGAGAATGGTGTC CTGTG 
KSGGWEPSTEVVYPENGVLC 

550 560 570 580 590 600 

TGGACGTAATGTGATGGCCCTTAAAGTCGGTGATCGTCGTTTGATCTGCCATCACTATAC 
GRNVMALKVGDRRL I CHHYT 

610 620 630 640 650 660 

TTCTTACAGGTCCAAGAAAGCAGTCCGTGCCTTGACAATGCCAGGATTTCATTTTACAGA 
SYRSKKAVRALTMPGFHFTD 

670 680 690 700 710 720 

CATCCGCCTTCAGATGCTGAGGAAAGAGAAAGACGAGTACTTTGAACTGTACGAAGCATC 
IRLQMLRKEKDEYFELYEAS 

730 740 750 760 

TGTGGCTAGGTACAGTGATCTTCCTGAAAAAGCAAATTGA 
VARYSDLPEKAN* {SEQ ID NOs : 07 & 08) 



Figure 7A 
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Figure 8 



Crispa 44-6 mutant possesses six amino acid substitutions vs. wild type: 
A2S/T36 A, A65E,C 1 43S,L 1 73H,P20 1L. 

TCTGGTTTGTTGAAAGAAAGTATGCGCATCAAGATGTACAT 
SGLLKESMRIKMYM 

GGAAGGCACGGTTAATGGCCATTATTTCAAGTGTGAAGGAGAGGGAGACGGCAACCCATT 
EGTVNGHYFKCEGEGDGNPF 

TGCAGGTACGCAGAGCATGAGGATTCATGTCACCGAAGGGGCTCCATTACCATTTGCCTT 
AGTQSMRIHVTEGAPLPFAF 

CGACATTTTGGCACCGTGTTGTGCGTACGGCAGCAGGACCTTTGTCCACCATAGGGCAGA 
DILAPCCAYGSRTFVHHTAE 

GATTCCCGATTTCTTCAAGCAGTCTTTCCCTGAAGGCTTTACTTGGGAAAGAACCAGAAC 
IPDFFKQSFPEGFTWERTTT 

CTATGAAGATGGAGGCATTCTTACTGCTCATCAGGACACAAGCCTGGAGGGGAACTGCCT 
YEDGGILTAHQDTSLEGNCL 



TATATACAAGGTGAAAGTCCTTGGTACCAATTTTCCTGCTGATGGCCCCGTGATGAAGAA 
IYKVKVLGTNFPADGPVMKN 

CAAATCAGGAGGATGGGAGCCAAGCACTGAGGTGGTTTATCCAGAGAATGGTGTCCTGTG 
KSGGWEPSTEVVYPENGVLC 



TGGACGTAATGTGATGGCCCTTAAAGTCGGTGATCGTCGTTTGATCTGCCATCACTATAC 
GRNVMALKVGDRRLI CHHYT 

TTCTTACAGGTCCAAGAAAGCAGTCCGTGCCTTGACAATGCCAGGATTTCATTTTACAGA 
SYRSKKAVRALTMPGFHFTD 



CATCCGCCTTCAGATGCTGAGGAAAGAGAAAGACGAGTACTTTGAACTGTACGAAGCATC 
IRLQMLRKEKDEYFELYEAS 

TGTGGCTAGGTACAGTGATCTTCCTGAAAAAGCAAATTGA 
VARYSDLPEKAN* 



(SEQ ID NO: 09 & 10) 



Figure 10 



The amino acid sequence of FP10-crl is: 
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(SEQ ID 



NO:12) 

A "humanized" nt sequence encoding the above cr-1 mutant is: 



ATGGTGAGCGGCCTGCTGAAGGAGAGTATGCGCATCAAGATGTACATGGAGGGCACCGTGAACGGCCAC 
TACTTCAAGTGCGAGGGCGAGGGCGACGGCAACCCCTTCGCCGGCACCCAGAGCATGAGAATCCACGTG 
ACCGAGGGCGCCCCCCTGCCCTTCGCCTTCGACATCCTGGCCCCCTGCTGCGAGTACGGCAGCAGGACC 
TTCGTGCACCACACCGCCGAGATCCCCGACTTCTTCAAGCAGAGCTTCCCCGAGGGCTTCACCTGGGAG 
AGAACCACCACCTACGAGGACGGCGGCATCCTGACCGCCCACCAGGACACCAGCCTGGAGGGCAACTGC 
CTGATCTACAAGGTGAAGGTGCACGGCACCAACTTCCCCGCCGACGGCCCCGTGATGAAGAACAAGAGC 
GGCGGCTGGGAGCCCAGCACCGAGGTGGTGTACCCCGAGAACGGCGTGCTGTGCGGCCGGAACGTGATG 
GCCCTGAAGGTGGGCGACCGGCACCTGATCTGCCACCACTACACCAGCTACCGGAGCAAGAAGGCCGTG 
CGCGCCCTGACCATGCCCGGCTTCCACTTCACCGACATCCGGCTCCAGATGCTGCGGAAGAAGAAGGAC 
GAGTACTTCGAGCTGTACGAGGCCAGCGTGGCCCGGTACAGCGACCTGCCCGAGAAGGCCAACTGA 

(SEQ ID NO: 11) 



Alternative crl amino acid sequence 

MSGLLKESMRIKMYMEGTVNGHYFKCEGEGDGNPFAGTQSMRIHVTEGAPLPFAFDILAPCCEYGSRTF 
VHHTAEIPDFFKQSFPEGFTWERTTTYEDGGILTAHQDTSLEGNCLIYKVKVHGTNFPADGPVMKNKSG 
GWEPS TEVVYPENGVLCGRNVMALKVGDRRL I CHHYTS YRSKKAVRALTMPGFHFTDIRLQMLRKEKDE 
YFEL YE AS VAR YSDL P EKAN* SEQ ID NO: 14) 



Amino acid sequence encoding above alternative sequence: 

ATGGTGAGCGGCCTGCTGAAGGAGAGCATGCGCATCAAGATGTACATGGAGGGCACCGTGAACGGCCAC 
TACTTCAAGTGCGAGGGCGAGGGCGACGGCAACCCCTTCGCCGGCACCCAGAGCATGCGGATCCACGTG 
ACCGAGGGCGCCCCCCTGCCCTTCGCCTTCGACATCCTGGCCCCCTGCTGCGAGTACGGCAGCAGGACC 
TTCGTGCACCACACCGCCGAGATCCCCGACTTCTTCAAGCAGAGCTTCCCCGAGGGCTTCACCTGGGAG 
AGAAC CACCAC CTACGAGGACGGCGGCATCCTGACCGCC CAC CAGGACAC C AGC CTGGAGGGCAACTGC 
CTGATCTACAAGGTGAAGGTGCTGGGCACCAACTTCCCCGCCGACGGCCCCGTGATGAAGAACAAGAGC 
GGCGGCTGGGAGCCCAGCACCGAGGTGGTGTACCCCGAGAACGGCGTGCTGTGCGGCCGGAACGTGATG 
GCCCTGAAGGTGGGCGACCGGCGGCTGATCTGCCACCACTACACCAGCTACCGGAGCAAGAAGGCCGTG 
CGGGCCCTGACCATGCCCGGCTTCCACTTCACCGACATCCGGCTGCAGATGCTGCGGAAGGAGAAGGAC 
GAGTACTTCGAGCTGTACGAGGCCAGCGTGGCCCGGTACAGCGACCTGCCCGAGAAGGCCAACTGA 
(SEQ ID NO: 13) 



FIGURE 12 



Cr -44 9 -tandem (4-amino acid linker between monomers is in double underline) * 



1 A CCG GT C GCC ACC ATG GTG AGC GGC CTG CTG AAG GAG AGC ATG CGC 46 

1 Agel MVSGLLKESMR 11 

47 ATC AAG ATG TAC ATG GAG GGC ACC GTG AAC GGC CAC TAC TTC AAG TGC 94 

12 I KMYMEGTVNGHYFKC 27 

95 GAG GGC GAG GGC GAC GGC AAC CCC TTC GCC GGC ACC CAG AGC ATG CGG 142 

28EGEGDGNPFAGTQSMR 43 

143 ATC CAC GTG ACC GAG GGC GCC CCC CTG CCC TTC GCC TTC GAC ATC CTG 190 

44 IHVTEGAPLPFAFDIL 59 

191 GCC CCC TGC TGC GAG TAC GGC AGC AGG ACC TTC GTG CAC CAC ACC GCC 238 

60APCCEYGSRTFVHHTA 75 

^ 239 GAG ATC CCC GAC TTC TTC AAG CAG AGC TTC CCC GAG GGC TTC ACC TGG 286 

^ 76EIPDFFKQSFPEGFTW 91 

■fl 28 7 GAG AGA ACC ACC ACC TAC GAG GAC GGC GGC ATC CTG ACC GCC CAC CAG 334 

%I 92ERTTTYEDGGILTAHQ 107 

01 335 GAC ACC AGC CTG GAG GGC AAC TGC CTG ATC TAC AAG GTG AAG GTG CTG 3 82 

%j 108 DTSLEGNCLIYKVKVL 123 

^ 3 83 GGC ACC AAC TTC CCC GCC GAC GGC CCC GTG ATG AAG AAC AAG AGC GGC 430 

* 124 GTNFPADGPVMKNKSG 139 

^ 431 GGC TGG GAG CCC AGC ACC GAG GTG GTG TAC CCC GAG AAC GGC GTG CTG 478 

M» 140 GWEPSTEVVYPENGVL 155 

p 479 TGC GGC CGG AAC GTG ATG GCC CTG AAG GTG GGC GAC CGG CGG CTG ATC 52 6 

U 156 CGRNVMALKVGDRRL I 171 

527 TGC CAC CAC TAC ACC AGC TAC CGG AGC AAG AAG GCC GTG CGG GCC CTG 5 74 

172 CHHYTSYR SKKAVRAL 187 

575 ACC ATG CCC GGC TTC CAC TTC ACC GAC ATC CGG CTG CAG ATG CTG CGG 622 

188 TMPGFHFTDIRLQMLR 203 

62 3 AAG GAG AAG GAC GAG TAC TTC GAG CTG TAC GAG GCC AGC GTG GCC CGG 670 

204 KEKDEYFELYEASVAR 219 

6 71 TAC AGC GAC CTG CCC GAG AAG GCC AAC AGA TCT_CCC GGG ATG GTG AGC 718 

220 YSDLPEKANRSPGMVS 235 

719 GGC CTG CTG AAG GAG AGC ATG CGC ATC AAG ATG TAC ATG GAG GGC ACC 766 

236 GLLKESMRIKMYMEGT 251 

76 7 GTG AAC GGC CAC TAC TTC AAG TGC GAG GGC GAG GGC GAC GGC AAC CCC 814 

252 VNGHYFKCEGEGDGNP 267 

815 TTC GCC GGC ACC CAG AGC ATG CGG ATC CAC GTG ACC GAG GGC GCC CCC 862 

268 FAGTQSMR IHVTEGAP 283 



863 
284 



CTG CCC TTC GCC TTC GAC ATC CTG GCC CCC TGC TGC GAG TAC GGC AGC 
LPFAFDILAPCCEYGS 



910 
299 



Figure 12 (continued) 

911 AGG ACC TTC GTG CAC CAC ACC GCC GAG ATC CCC GAC TTC TTC AAG CAG 958 

300 RTFVHHTAEIPDFFKQ 315 

959 AGC TTC CCC GAG GGC TTC ACC TGG GAG AGA ACC ACC ACC TAC GAG GAC 1006 

316 SFPEGFTWERTTTYED 331 

1007 GGC GGC ATC CTG ACC GCC CAC CAG GAC ACC AGC CTG GAG GGC AAC TGC 1054 

332 GGILTAHQDTSLEGNC 347 

1055 CTG ATC TAC AAG GTG AAG GTG CTG GGC ACC AAC TTC CCC GCC GAC GGC 1102 

348 LIYKVKVLGTNFPADG 363 

1103 CCC GTG ATG AAG AAC AAG AGC GGC GGC TGG GAG CCC AGC ACC GAG GTG 115 0 

364 PVMKNKSGGWEPSTEV 379 

1151 GTG TAC CCC GAG AAC GGC GTG CTG TGC GGC CGG AAC GTG ATG GCC CTG 1198 

380 VYPENGVLCGRNVMAL 395 

1199 AAG GTG GGC GAC CGG CGG CTG ATC TGC CAC CAC TAC ACC AGC TAC CGG 1246 

396 KVGDRRLICHHYTSYR 411 

124 7 AGC AAG AAG GCC GTG CGG GCC CTG ACC ATG CCC GGC TTC CAC TTC ACC 1294 

412 SKKAVRALTMPGFHFT 427 

1295 GAC ATC CGG CTG CAG ATG CTG CGG AAG GAG AAG GAC GAG TAC TTC GAG 1342 

428 DIRLQMXiRKEKDEYFE 443 

1343 CTG TAC GAG GCC AGC GTG GCC CGG TAC AGC GAC CTG CCC GAG AAG GCC 13 90 

444 LYEASVARYSDLPEKA 459 

1391 AAC TGA 
460 N * 



(SEQ ID NOS. 15 & 16) 



Figure 13 

Cr-449- tandem- act in (4 -amino acid linker between Cr-449 monomers is noted in 
double underline; 4 -amino acid linker between second Cr-449 and actin is noted 
in dashed underline) . 

1 A CCG GT C GCC ACC ATG GTG AGC GGC CTG CTG AAG GAG AGC ATG CGC 46 
1 Agel MVSGLLKESMR 11 

47 ATC AAG ATG TAC ATG GAG GGC ACC GTG AAC GGC CAC TAC TTC AAG TGC 94 
12 IKMYMEGTVNGHYFKC 27 

95 GAG GGC GAG GGC GAC GGC AAC CCC TTC GCC GGC ACC CAG AGC ATG CGG 142 
28EGEGDGNPFAGTQSMR 43 

143 ATC CAC GTG ACC GAG GGC GCC CCC CTG CCC TTC GCC TTC GAC ATC CTG 190 
44IHVTEGAPLPFAFDIL 59 

191 GCC CCC TGC TGC GAG TAC GGC AGC AGG ACC TTC GTG CAC CAC ACC GCC 23 8 
60APCCEYGSRTFVHHTA 75 

23 9 GAG ATC CCC GAC TTC TTC AAG CAG AGC TTC CCC GAG GGC TTC ACC TGG 286 
76EIPDFFKQSFPEGFTW 91 

2 87 GAG AGA ACC ACC ACC TAC GAG GAC GGC GGC ATC CTG ACC GCC CAC CAG 334 
92 ERTTTYEDGGILTAHQ 107 

3 35 GAC ACC AGC CTG GAG GGC AAC TGC CTG ATC TAC AAG GTG AAG GTG CTG 382 
108 DTSLEGNCLIYKVKVL 123 

3 83 GGC ACC AAC TTC CCC GCC GAC GGC CCC GTG ATG AAG AAC AAG AGC GGC 4 30 

124 GTNFPADGPVMKNKSG 139 

431 GGC TGG GAG CCC AGC ACC GAG GTG GTG TAC CCC GAG AAC GGC GTG CTG 4 78 

140 GWEPSTEVVYPENGVL 155 

479 TGC GGC CGG AAC GTG ATG GCC CTG AAG GTG GGC GAC CGG CGG CTG ATC 526 

156 CGRNVMALKVGDRRLI 171 

52 7 TGC CAC CAC TAC ACC AGC TAC CGG AGC AAG AAG GCC GTG CGG GCC CTG 5 74 

172 CHHYTSYRSKKAVRAL 187 

5 75 ACC ATG CCC GGC TTC CAC TTC ACC GAC ATC CGG CTG CAG ATG CTG CGG 622 

188 TMPGFHFTD1RLQMLR 203 

623 AAG GAG AAG GAC GAG TAC TTC GAG CTG TAC GAG GCC AGC GTG GCC CGG 670 

204 KEKDEYFELYEASVAR 219 

671 TAC AGC GAC CTG CCC GAG AAG GCC AAC AGA__TCT_CCC_GGG ATG GTG AGC 718 

220 YSDLPEKANRSPGMVS 235 

719 GGC CTG CTG AAG GAG AGC ATG CGC ATC AAG ATG TAC ATG GAG GGC ACC 766 

236 GLLKESMRIKMYMEGT 251 

767 GTG AAC GGC CAC TAC TTC AAG TGC GAG GGC GAG GGC GAC GGC AAC CCC 814 

252 VNGHYFKCEGEGDGNP 267 

815 TTC GCC GGC ACC CAG AGC ATG CGG ATC CAC GTG ACC GAG GGC GCC CCC 862 

268 FAGTQSMRIHVTEGAP 283 



Figure 13 (continued) 

863 CTG CCC TTC GCC TTC GAC ATC CTG GCC CCC TGC TGC GAG TAC GGC AGC 910 

284 LPFAFDILAPCCEYGS 299 

911 AGG ACC TTC GTG CAC CAC ACC GCC GAG ATC CCC GAC TTC TTC AAG CAG 958 

300 RTFVHHTAEIPDFFRQ 315 

959 AGC TTC CCC GAG GGC TTC ACC TGG GAG AGA ACC ACC ACC TAC GAG GAC 1006 

316 SFPEGFTWERTTTYED 331 

1007 GGC GGC ATC CTG ACC GCC CAC CAG GAC ACC AGC CTG GAG GGC AAC TGC 1054 

332 GGILTAHQDTSLEGNC 347 

1055 CTG ATC TAC AAG GTG AAG GTG CTG GGC ACC AAC TTC CCC GCC GAC GGC 1102 

348 LIYKVKVLGTNFPADG 363 

1103 CCC GTG ATG AAG AAC AAG AGC GGC GGC TGG GAG CCC AGC ACC GAG GTG 1150 

364 PVMKNKSGGWEPSTEV 379 

1151 GTG TAC CCC GAG AAC GGC GTG CTG TGC GGC CGG AAC GTG ATG GCC CTG 1198 

380 VYPENGVLCGRNVMAL 395 

119 9 AAG GTG GGC GAC CGG CGG CTG ATC TGC CAC CAC TAC ACC AGC TAC CGG 1246 

396 KVGDRRLICHHYTSYR 411 

124 7 AGC AAG AAG GCC GTG CGG GCC CTG ACC ATG CCC GGC TTC CAC TTC ACC 12 94 

412 SKKAVRALTMPGFHFT 427 

12 95 GAC ATC CGG CTG CAG ATG CTG CGG AAG GAG AAG GAC GAG TAC TTC GAG 1342 

428 DIRLQMLRKEKDEYFE 443 

1343 CTG TAC GAG GCC AGC GTG GCC CGG TAC AGC GAC CTG CCC GAG AAG GCC 1390 

444 LYEASVARYSDLPEKA 459 

1391 AAC AGA ACT CGA GCT ATG GAT GAT GAT ATC GCC G. . . 1424 

460 N R T R A M D D D I A... 470 

actin 

{SEQ ID NOs. 17 & 18) . 




Fluor TexRed 




Figure 15 



Heteractis crispa chromoprotein hcCP mut C148S 

C148S according to GFP numbering 
C143S according to self-numbering. 

ATGGCTGGTTTGTTGAAAGAAAGTATGCGCATCAAGATGTACAT 
MAGLLKESMRIKMYM 

GGAAGGCACGGTTAATGGCCATTATTTCAAGTGTGAAGGAGAGGGAGACGGCAACCCATT 
EGTVNGHYFKCEGEGDGNPF 

TACAGGTACGCAGAGCATGAGGATTCATGTCACCGAAGGGGCTCCATTACCATTTGCCTT 
TGTQSMRIHVTEGAPLPFAF 

CGACATTTTGGCACCGTGTTGTGAGTACGGCAGCAGGACCTTTGTCCACCATACGGCAGA 
DILAPCCEYGSRTFVHHTAE 

GATTCCCGATTTCTTCAAGCAGTCTTTCCCTGAAGGCTTTACTTGGGAAAGAACCACAAC 
IPDFFKQSFPEGFTWERTTT 

CTATGAAGATGGAGGCATTCTTACTGCTCATCAGGACACAAGCCTGGAGGGGAACTGCCT 
YEDGGILTAHQDTSLEGNCL 

TATATACAAGGTGAAAGT CCTTGGT ACCAATTTTCCTGCTGATGGC C C CGTGATGAAGAA 
IYKVKVLGTNFPADGPVMKN 

CAAATCAGGAGGATGGGAGCCAAGCACTGAGGTGGTTTATCCAGAGAATGGTGTCCTGTG 
KSGGWEPSTEVVYPENGVLC 

TGGACGTAATGTGATGGCCCTTAAAGTCGGTGATCGTCGTTTGATCTGCCATCTCTATAC 
GRNVMALKVGDRRLICHLYT 

TTCTTACAGGTCCAAGAAAGCAGTCCGTGCCTTGACAATGCCAGGATTTCATTTTACAGA 
SYRSKKAVRALTMPGFHFTD 

CATCCGCCTTCAGATGCCGAGGAAAAAGAAAGACGAGTACTTTGAACTGTACGAAGCATC 
IRLQMPRKKKDEYFELYEAS 

TGTGGCTAGGTACAGTGATCTTCCTGAAAAAGCAAATTGA 
VARYSDLPEKAN* 
SEQ ID NO: 23 & 24 



Figure 16 



Crispa 44-6 mutant possesses six amino acid substitutions vs. wild type: 
A2S/T36A,A65E ? C143S ? L173H,P201L. 

TCTGGTTTGTTGAAAGAAAGTATGCGCATCAAGATGTACAT 
SGLLKESMRIKMYM 

GGAAGGCACGGTTAATGGCCATTATTTCAAGTGTGAAGGAGAGGGAGACGGCAACCCATT 
EGTVNGHYFKCEGEGDGNPF 

TGCAGGTACGCAGAGCATGAGGATTCATGTCACCGAAGGGGCTCCATTACCATTTGCCTT 
AGTQSMRIHVTEGAPLPFAF 

CGACATTTTGGCACCGTGTTGTGCGTACGGCAGCAGGACCTTTGTCCACCATACGGCAGA 
DILAPCCAYGSRTFVHHTAE 

GATTCCCGATTTCTTCAAGCAGTCTTTCCCTGAAGGCTTTACTTGGGAAAGAACCACAAC 
IPDFFKQSFPEGFTWERTTT 

CTATGAAGATGGAGGCATTCTTACTGCTCATCAGGACACAAGCCTGGAGGGGAACTGCCT 
YEDGGILTAHQDTSLEGNCL 

M 

TATATACAAGGTGAAAGTCCTTGGTACCAATTTTCCTGCTGATGGCCCCGTGATGAAGAA 
IYKVKVLGTNFPADGPVMKN 

CAAATCAGGAGGATGGGAGCCAAGCACTGAGGTGGTTTATCCAGAGAATGGTGTCCTGTG 
KSGGWEPSTEVVYPENGVLC 



TGGACGTAATGTGATGGCCCTTAAAGTCGGTGATCGTCGTTTGATCTGCCATCACTATAC 
GRNVMALKVGDRRLI CHHYT 

TTCTTACAGGTCCAAGAAAGCAGTCCGTGCCTTGACAATGCCAGGATTTCATTTTACAGA 
SYRSKKAVRALTMPGFHFTD 



CATCCGCCTTCAGATGCTGAGGAAAAAGAAAGACGAGTACTTTGAACTGTACGAAGCATC 
IRLQMLRKKKDEYFELYEAS 

TGTGGCTAGGTACAGTGATCTTCCTGAAAAAGCAAATTGA 

VARYSDLPEKAN* 
SEQ ID NO: 25 & 26 
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Heteractis crispa chromoprotein wild type (base isoform) 

10 20 30 40 50 60 

5 f ACCATTTGCTTTGGTTCCTTGGCAAACGAAAGTTTAGAACGAAAACTGACCCAAATTACA 
70 80 90 100 110 120 

TCTTCCTCCTGGATCCTTACCATGGCTGGTTTGTTGAAAGAAAGTATGCGCATCAAGATG 

MAGLLKESMRIKM 

130 140 150 160 170 180 

TACATGGAAGGCACGGTTAATGGCCATTATTTCAAGTGTGAAGGAGAGGGAGACGGCAAC 
YMEGTVNGHYFKCEGEGDGN 

190 200 210 220 230 240 

CCATTTACAGGTACGCAGAGCATGAGGATTC ATGT CAC CGAAGGGGCTCCATTACCATTT 
PFTGTQSMRIHVTEGAPLPF 

250 260 270 280 290 300 

GCCTTCGACATTTTGGCACCGTGTTGTGAGTACGGCAGCAGGACCTTTGTCCACCATACG 
AFDILAPCCEYGSRTFVHHT 

310 320 330 340 350 360 

GCAGAGATTCCCGATTTCTTCAAGCAGTCTTTCCCTGAAGGCTTTACTTGGGAAAGAACC 
AEIPDFFKQSFPEGFTWERT 

370 380 390 400 410 420 

ACAACCTATGAAGATGGAGGCATTCTTACTGCTCATCAGGACACAAGCCTGGAGGGGAAC 
TTYEDGGILTAHQDTSLEGN 

430 440 450 460 470 480 

TGCCTTATATACAAGGTGAAAGTCCTTGGTACCAATTTTCCTGCTGATGGCCCCGTGATG 
CLIYKVKVLGTNFPADGPVM 

490 500 510 520 530 540 

AAGAACAAATCAGGAGGATGGGAGCCATGCACTGAGGTGGTTTATCCAGAGAATGGTGTC 
KNKSGGWEPCTEVVYPENGV 

550 560 570 580 590 600 

CTGTGTGGACGTAATGTGATGGCCCTTAAAGTCGGTGATCGTCGTTTGATCTGCCATCTC 
LCGRNVMALKVGDRRLICHL 

610 620 630 640 650 660 

TATACTTCTTACAGGTCCAAGAAAGCAGTCCGTGCCTTGACAATGCCAGGATTTCATTTT 
YTSYRSKKAVRALTMPGFHF 

670 680 690 700 710 720 

ACAGACATCCGCCTTCAGATGCCGAGGAAAACGAAAGACGAGTACTTTGAACTGTACGAA 
TDIRLQMPRKTKDEYFELYE 

730 740 750 760 770 780 

GCATCTGTGGCTAGGTACAGTGATCTTCCTGAAAAAGCAAATTGATTGTTCCCAGTGACA 
ASVARYSDLPEKAN* 

790 800 810 820 830 840 

CCAGACTGCTGTCAGCTTTTGGTTAAAGCCCGAAAGACAAAAGGACATTTGTAGTTTAGT 

850 860 870 880 890 900 

TTATATTTCCCTTTCATTTGTGAATCAACATTGTACTCTCTGTAAACCTTTAAAATGCTC 

910 

CATTAAACCT 3' (SEQ ID NOs : 27 & 28) 



